Czech language database of car speech and environmental noise

نویسندگان

  • Petr Pollák
  • Josef Vopièka
  • Pavel Sovka
چکیده

This paper will present new Czech language twochannel (stereo) speech database recorded in car environment. The created database was designed for experiments with speech enhancement for communication purposes and for the study and the design of a robust speech recognition systems. It respects car noise environment which is currently at the top of the interest. Tools for automated phoneme labelling based on Baum-Welch re-estimation were designed. The noise analysis of the car background environment was done.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Three Czech Speech Databases from the Standpoint of Lombard Effect Appearance

This paper focuses on three Czech speech databases recorded in actual and simulated noisy conditions and explores their suitability for LE analysis and modeling. Parameters of Czech SPEECON, CZKCC car database and newly established Czech Lombard Speech Database (CLSD) are compared. All three databases comprise speech recorded in neutral conditions and speech uttered in noise of the moving car. ...

متن کامل

Speech Recognition in the Automobile

Acknowledgments Chapter 1: Introduction Chapter 2: The SPHINX Speech Recognition System 1 2 3 5 2.1 Signal Processing ............................ 5 2.2 Clustering and Vector Quantization ..................... 6 2.3 Hidden Markov Models .......................... 7 2.4 Speech Unit ............................... 7 Chapter 3: The Motorola Car Database and AN4 Database 8 3.1 The Motorola Car Data...

متن کامل

Design and collection of Czech Lombard speech database

In this paper, design, collection and parameters of newly proposed Czech Lombard Speech Database (CLSD) are presented. The database focuses on analysis and modeling of Lombard effect to achieve robust speech recognition improvement. The CLSD consists of neutral speech and speech produced in various types of simulated noisy background. In comparison to available databases dealing with Lombard ef...

متن کامل

Reduced complexity equalization of lombard effect for speech recognition in noisy adverse environments

In real-world adverse environments, speech signal corruption by background noise, microphone channel variations, and speech production adjustments introduced by speakers in an effort to communicate efficiently over noise (Lombard effect) severely impact automatic speech recognition (ASR) performance. Recently, a set of unsupervised techniques reducing ASR sensitivity to these sources of distort...

متن کامل

SPEECHDAT-CAR. A Large Speech Database for Automotive Environments

The aims of the SpeechDat-Car project are to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. As a result, a total of ten (10) equivalent and similar resources will be created. The 10 languages are Danish, each language 600 sessions will be recorded (from at least 300 speakers) in seven characteristic envir...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999